Nearest Neighbor Search in Multidimensional Spaces Depth Oral Report

نویسنده

  • Panayiotis Tsaparas
چکیده

The Nearest Neighbor Search problem is deened as follows: given a set P of n points, preprocess the points so as to eeciently answer queries that require nding the closest point in P to a query point q. If we are willing to settle for a point that is almost as close as the nearest neighbor, then we can relax the problem to the approximate Nearest Neighbor Search. Nearest Neighbor Search (exact or approximate) is an integral component in a wide range of applications that include multimedia databases, computational biology, data mining, and information retrieval. The common thread in all these applications is similarity search: given a database of objects, we want to return the object in the database that is most similar to a query object. The objects are mapped onto points in a high dimensional metric space , and similarity search reduces to a nearest neighbor search. The dimension of the underlying space may be in the order of a few hundreds, or thousands; therefore, we require algorithms that perform eeciently even for spaces of high dimension. Due to its importance, the Nearest Neighbor Search problem has been a subject of research for many decades, and in many diierent elds. In this work we survey some of the past and recent results that marked the evolution of the problem. The report starts with the optimal solutions for low-dimensional spaces, then it goes through the solutions for the exact and approximate problem for spaces of arbitrary dimension, and then it nishes oo with the recent attempts to prove lower bounds in order to specify the hardness of the problem. We also investigate the problem from the practitioner's point of view, and we provide an overview of the indexing problem for Nearest Neighbor Queries. Finally, we identify some open questions, and interesting problems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Nearest Neighbor Analysis of Psychological Spaces

Geometric models impose an upper bound on the number of points that can share the same nearest neighbor. A much more restrictive bound is implied by the assumption that the data points represent a sample from some continuous distribution in a multidimensional Euclidean space. The analysis of 100 data sets shows that most perceptual data satisfy the geometric-statistical bound whereas many conce...

متن کامل

An efficient nearest neighbor search in high-dimensional data spaces

Similarity search in multimedia databases requires an efficient support of nearest neighbor search on a large set of high-dimensional points. A technique applied for similarity search in multimedia databases is to transform important properties of the multimedia objects into points of a high-dimensional feature space. The feature space is usually indexed using a multidimensional index structure...

متن کامل

Incremental Reverse Nearest Neighbor Ranking in Vector Spaces

In this paper, we formalize the novel concept of incremental reverse nearest neighbor ranking and suggest an original solution for this problem. We propose an efficient approach for reporting the results incrementally without the need to restart the search from scratch. Our approach can be applied to a multidimensional feature database which is hierarchically organized by any R-tree like index ...

متن کامل

Fast Nearest-Neighbor Search Algorithms Based on High-Multidimensional Data

Similarity search in multimedia databases requires an efficient support of nearest-neighbor search on a large set of high-dimensional points as a basic operation for query processing. As recent theoretical results show, state of the art approaches to nearest-neighbor search are not efficient in higher dimensions. In our new approach, we therefore pre-compute the result of any nearest-neighbor s...

متن کامل

Ε-isa: an Incremental Lower Bound Approach for Efficiently Finding Approximate Nearest Neighbor of Complex Vague Queries

In our context, a complex vague query means a multifeature nearest neighbor query. Answering such queries requires the system to search on some feature spaces individually and then combine the searching results to find the final answers. The feature spaces are commonly multidimensional spaces and may consist of a vast amount of data. Therefore searching costs, including IO-cost and CPU-cost, ar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999